On the statistical significance of nucleic acid similarities
نویسندگان
چکیده
When evaluating sequence similarities among nucleic acids by the usual methods, statistical significance is often found when the biological significance of the similarity is dubious. We demonstrate that the known statistical properties of nucleic acid sequences strongly affect the statistical distribution of similarity values when calculated by standard procedures. We propose a series of models which account for some of these known statistical properties. The utility of the method is demonstrated in evaluating high relative similarity scores in four specific cases in which there is little biological context by which to judge the similarities. In two of the cases we identify the statistical properties which are responsible for the apparent similarity. In the other two cases the statistical significance of the similarity persists even when the known statistical properties of sequences are modelled. For one of these cases biological significance is likely while the other case remains an enigma.
منابع مشابه
The statistical distribution of nucleic acid similarities.
All pairs of a large set of known vertebrate DNA sequences were searched by computer for most similar segments. Analysis of this data shows that the computed similarity scores are distributed proportionally to the logarithm of the product of the lengths of the sequences involved. This distribution is closely related to recent results of Erdos and others on the longest run of heads in coin tossi...
متن کاملRetrieval accuracy, statistical significance and compositional similarity in protein sequence database searches
Protein sequence database search programs may be evaluated both for their retrieval accuracy--the ability to separate meaningful from chance similarities--and for the accuracy of their statistical assessments of reported alignments. However, methods for improving statistical accuracy can degrade retrieval accuracy by discarding compositional evidence of sequence relatedness. This evidence may b...
متن کاملStatistical and Practical Significance of Articles at Sports Biomechanics Conferences
Background. The importance of using statistical approaches has increased and became necessary for researchers and specialists in sports biomechanics because they need more objective and accurate methods to increase knowledge. Objectives. Evaluate the reality of using practical significance in the articles published in scientific conferences in the biomechanical sport. Methods. One hundred twe...
متن کاملCellular Morphology and Immunologic Properties of Escherichia coli Treated With Antimicrobial Antisense Peptide Nucleic Acid
Background & Objectives: Antisense peptide nucleic acids (PNA) that target growth essential genes show potent bactericidal properties without cell lysis. We considered the possibility that whether PNA treatment influence the bacteria total nucleic acids content and apply approach to develop a new delivery system to Dendritic cells (DCs). DCs are the most potent antigen presenting cells in th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nucleic acids research
دوره 12 1 Pt 1 شماره
صفحات -
تاریخ انتشار 1984